An Efficient Global Optimization Approach for Rough Set Based Dimensionality Reduction
نویسندگان
چکیده
The theory of rough set, proposed by Pawlak, provides a formal tool for knowledge discovery from imprecise and incomplete data. Dimensionality reduction (RED) is a crucial problem for rough set based data mining. Unfortunately, it proves to be a NP-hard problem. Many reduction algorithms based on heuristic information have been developed, but they are all based on Boolean space {0,1}m. In this paper, we introduce the Universal RED problem model, or UniRED, which transforms the discrete attributes reduction problems on Boolean space {0,1}m into continuous global optimization problems on real space E. Based on this transformation, we develop coordinate gradient descent algorithm RED2.1, and coordinate direct descent algorithm RED3.1 for attributes reduction problems. In order to investigate the robustness and efficiency of proposed algorithms, we execute our algorithm RED3.1 on problems from UCI. Meanwhile, the comparison between algorithm RED3.1 and other famous reduct algorithms, such as dynamical reduct algorithm and genetic reduct algorithm, is presented. The experimental results indicate the robustness and efficiency of our algorithms.
منابع مشابه
A Dimension Reduction Approach to Classification Based on Particle Swarm Optimisation and Rough Set Theory
Dimension reduction aims to remove unnecessary attributes from datasets to overcome the problem of “the curse of dimensionality”, which is an obstacle in classification. Based on the analysis of the limitations of the standard rough set theory, we propose a new dimension reduction approach based on binary particle swarm optimisation (BPSO) and probabilistic rough set theory. The new approach in...
متن کاملA New Approach for Knowledge Based Systems Reduction using Rough Sets Theory (RESEARCH NOTE)
Problem of knowledge analysis for decision support system is the most difficult task of information systems. This paper presents a new approach based on notions of mathematical theory of Rough Sets to solve this problem. Using these concepts a systematic approach has been developed to reduce the size of decision database and extract reduced rules set from vague and uncertain data. The method ha...
متن کاملNature Inspired Multi-Swarm Heuristics for Multi-Knowledge Extraction
Multi-knowledge extraction is significant for many real-world applications. The nature inspired population-based reduction approaches are attractive to find multiple reducts in the decision systems, which could be applied to generate multi-knowledge and to improve decision accuracy. In this Chapter, we introduce two nature inspired populationbased computational optimization techniques namely Pa...
متن کاملRough Feature Selection for Intelligent Classifiers
The last two decades have seen many powerful classification systems being built for large-scale real-world applications. However, for all their accuracy, one of the persistent obstacles facing these systems is that of data dimensionality. To enable such systems to be effective, a redundancy-removing step is usually required to pre-process the given data. Rough set theory offers a useful, and fo...
متن کاملA Study on Rough Set Theory for Medical Image Segmentation
Rough set is approximate representation of a crisp set. Rough set theory provides an approach to approximation of sets that leads to useful forms of granular computing. Several applications have revealed the need to extend the traditional rough set approach. A special place among various extensions is taken by the approach that replaces the relation based on equivalence with a tolerance relatio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006